CDS

Accession Number TCMCG063C40066
gbkey CDS
Protein Id KAF7840429.1
Location complement(join(1591033..1591127,1591218..1591230,1591470..1591494,1591643..1591655,1591867..1591889,1592196..1592272,1592648..1592725,1592815..1592921,1594107..1594167,1594275..1594334,1595111..1595193,1595967..1596077,1596566..1596677,1596865..1596936))
Organism Senna tora
locus_tag G2W53_002727

Protein

Length 309aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA605066, BioSample:SAMN14013601
db_source JAAIUW010000002.1
Definition Pseudouridine-5'-phosphate glycosidase isoform C [Senna tora]
Locus_tag G2W53_002727

EGGNOG-MAPPER Annotation

COG_category Q
Description Indigoidine synthase A like protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction R01055        [VIEW IN KEGG]
KEGG_rclass RC00432        [VIEW IN KEGG]
RC00433        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K16329        [VIEW IN KEGG]
EC 4.2.1.70        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00240        [VIEW IN KEGG]
map00240        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGTCTTCATCTCTCTCGCGGCTAACCAATCTTCGGCCACACTTGGATTTGGCTCACACGAACGCAAAGGATGCTGGAGTCTGTACCATTGGAGGGTCAATCAAGATAGCTTCAGAGGTTTCTCAAGCTTTATCACTCGGGCGTCCAGTCGTTGCTCTTGAATCCACTATAATCTCGCATGGGATGCCATATCCTAAAAATTTGCAAACTGCTAAAGAGGTGGAGGCTATTGTGAGGGAGAATGGAGCAGTTCCTGCAACTATTGCAATCTTAGATGGCACTCCCTGCGTAGGTTTAAGTACAGAAGACCTTGAAAGGCTAGCTATTCTGGGAACCAGAGCTCAGAAGACAGCTCGAAGGGATATTGCACATGTTGTGGCTAGTGGTGGGAATGGTGCTACTACTGTCTCTGCAACCATGTTTTTGGCTTCTATGGTTAATATTCCAGTCTTTGTGACTGGAGGAATTGGGGGAGTGCATAGACATGGAGAACATACTATGGACATTTCTTCTGACCTCACTGAGCTTGGAAGAACACCGGTAGCAGTTATATCTGCTGGAGTAAAATCAATATTAGATATCCCTAGGACCCTCGAGTATCTGGAAACACAGGGGGTTTGTGTCGCAGCCTACAAGACCAATGAGTTTCCTGCCTTTTTCACTGAATCTAGTGGTTGTAAGGTGCATTGTCGGGTAGATACTCCTGAAGACTGTGCTCGGCTAATAGGCAGGAAAAAATCAATTAGTCTTTTTCAGATAAAGAATATCTTTTTAGCGTCATTATTGATTACAGAGCATAAAACAAGCAATTGCCAATTAACTAAAAAAATATACATTGCACTTGTGAAGAATAATGCTCTTATTGGAGCTAAAGTTGCCGTAGCCCTTGCTCAGATCAGAGGACATTTTCCAAGATCATCTCTTTGA
Protein:  
MASSSLSRLTNLRPHLDLAHTNAKDAGVCTIGGSIKIASEVSQALSLGRPVVALESTIISHGMPYPKNLQTAKEVEAIVRENGAVPATIAILDGTPCVGLSTEDLERLAILGTRAQKTARRDIAHVVASGGNGATTVSATMFLASMVNIPVFVTGGIGGVHRHGEHTMDISSDLTELGRTPVAVISAGVKSILDIPRTLEYLETQGVCVAAYKTNEFPAFFTESSGCKVHCRVDTPEDCARLIGRKKSISLFQIKNIFLASLLITEHKTSNCQLTKKIYIALVKNNALIGAKVAVALAQIRGHFPRSSL